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THE   UNIVERSITY   OF   NORTH  CAROLINA 

CHAPEL  HILL 


COMPUTATION  CENTER 


To  :     All  Faculty  Members 

From      :     Director,  Computation  Center 

Subject:     Computation  Center  Program  and  Systems  Library 


The  Computation  Center  was  established  in  order  to  provide  the 
University  with  the  capabilities  and  the  power  of  a  modern, 
large-scale  digital  computer  for  use  in  research  and  education. 
The  enclosed  description  of  the  program  and  systems  library  of 
the  Center  will  assist  faculty  members  in  becoming  acquainted 
with  the  computer's  capabilities  and  possible  applications  in 
their  fields  of  interest.     It  is  hoped  that  this  will  lead  to 
a  more  effective  use  of  the  Center  facilities. 


We  suggest  that  you  read  the  introduction  in  order  to  gain  an 
understanding  of  the  problems  involved  in  using  the  computer. 
The  remaining  sections,  which  consist  of  the  individual  program 
and  systems  descriptions,  may  then  be  scanned  for  items  of  par- 
ticular interest  or  application. 

As  additional  programs  are  added  to  the  Center  library,  supple- 
mentary sheets  for  Parts  1  and  II  of  this  publication  will  be 
periodically  distributed  to  interested  parties.     If  this  infor- 
mation is  of  value  in  your  work  and  if  you  would  like  to  be  on 
the  distribution  list  for  future  supplements,  remove  the  address 
sheet  on  this  material,  fold  and  staple  it  so  that  the  Computa- 
tion Center  address  on  the  reverse  side  is  visible,  and  return 
it  to  the  Center  by  campus  mail. 


W.  Hanson 
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DESCRIPTION  OF  THE  PROGRAM  AW  SYSTEMS  LIBRAi^Y 

COMPUTATION  CENTER 
UNIVERSITY  OF  NORTH  CAROLINA 


Chapel  Hill 
1963 


INTRODUCTIOH 


This  publication  presents  a  brief  description  of  the  programming 
facilities  that  are  available  at  the  Computation  Center.     It  is  hoped 
that  this  material  will  better  acquaint  faculty  members  with  the  compu- 
ter's capabilities  and  demonstrate  its  applicability  to  many  nex^j  problem 
areas  under  investigation  throughout  the  University.     It  is  through  a 
broadening  of  the  fields  of  application  of  the  computer  that  the  Compu- 
tation Center  can  properly  fulfill  its  purpose  which  is  to  provide  this 
new  and  powerful  tool  for  the  research  and  educational  activities  of  the 
University. 

In  order  to  solve  any  specific  problem  or  perform  a  desired  function 
with  an  automatic  digital  computer,  it  is  necessary  to  prepare  a  set  of 
detailed  instructions  known  as  a  computer  program.     These  instructions 
must  be  in  a  form  which  will  allow  the  computer  to  accept  and  execute 
them  --  a  form  known  as  "machine- language  coding."    As  can  be  expected, 
the  format  of  such  coding  differs  widely  from  the  language  of  the  original 
problem.     The  exact  characteristics  of  any  machine- language  program  are 
a  function  of  the  particular  computer  upon  which  the  program  will  be  exe- 
cuted.    Codes  are  frequently  all  numeric  or  may  be  mixtures  of  alphabetic 
and  numeric  characters.     Often  an  arithmetic  base  other  than  the  common 
decimal  base  is  used  in  these  instructions. 

An  example  of  a  group  of  machine -language  instructions  taken  from  a 
UWIVAC  1105  computer  program  is  given  belo^^7: 

00145  11     32000  04167 

00146  21    04160  07272 

00147  43  13614  00010 
00150    13    041&7  01351 


Besides  the  requirement  that  a  program  be  coded  in  mac , vine  language, 
it  is  necessary  that  every  character  in  the  coding  be  absolutely  perfect 
before  the  program  can  correctly  perform  its  intended  function.  Finally, 
when  one  considers  that  a  typical  computer  program  may  consist  of  from 
200  to  2000  (or  even  more)  machine- language  :*nstructions,   it  is  clear 
that  the  task  of  writing  a  correctly  coded  program  becomes  truly  immense. 

Unfortunately,  due  to  a  limit  in  resources,   the  Computer  Center  is 
not  able  to  maintain  a  large  staff  of  professional  programmers  vrhose 
duties  would  be  the  individual  preparation  of  such  computer  programs  for 
all  University  users.     Tlierefore,   in  order  to  make  the  computer  as  read- 
ily accessible  as  possible  to  the  general  user  while  operating  with  a 
limited  programming  staff,  the  following  has  been  done. 

First,  a  large  number  of  general-purpose  programs  covering  a  wide 
range  of  problem  areas  have  been  made  available  to  computer  users  through 
the  establishment  of  a  program  library.     An  example  of  one  such  program 
is  the  Analysis  of  General  Linear  Model  program  which  will  perform  any 
of  the  usual  computations  desired  in  analyzing  a  specific  multiple  regres- 
sion, correlation,  or  analysis  of  variance  problem  of  linear  models. 
The  li-brary  has  been  developed  by  the  programming  staff  of  the  Center 
and  through  the  program  exchange  facilities  maintained  by  the  UNIVAC 
Scientific  Exchange,  the  UNIVAC  1105  computer  users'  organization. 

Secondly,  the  programming  problem  has  been  greatly  simplified  by 
the  development  of  various  problem-  or  user-oriented  automatic  program- 
ming systems  for  the  UNIVAC  1105.     In  the  last  five  years  much  progress 
has  been  made  in  transforming  the  process  of  preparing  correctly  coded 
programs  from  a  time-consuming  manual  method  requiring  considerable  train- 
ing, to  a  partially  automatic  procedure  employing  the  computer  itself 
to  perform  much  of  the  tedious  and  error-prone  generation  of  machine- 
language  instructions. 


An  automatic  programming  or  automatic  coding  system  consists  of  (1) 
a  prescribed  problem-oriented  language  in  which  the  analyst  phrases  his 
problem  statement  and  solution  procedure,  and  (2)  a  complex  computer  pro- 
gram which  has  been  written  by  professional  programmers  to  translate  this 
statement  of  the  problem  into  a  set  of  machine- language  instructions  for 
subsequent  solution  by  the  computer. 

Consider  the  field  of  scientific  programming,  for  example.  Because 
most  scientific  and  engineering  problems  are  phrased  in  mathematical 
terms,  a  number  of  algebraic  programming  systems  or  "algebraic  compilers" 
have  been  devised.    Use  of  an  algebraic  programming  system  permits  the 
analyst  to  phrase  his  problem  in  an  easily  learned  language  involving  (1) 
algebraic  expressions,  common  mathematical  functions,  roots,  powers,  ine- 
qualities, etc.,   (2)  statements  relating  to  the  introduction  of  numerical 
data  and  the  generation  of  numerical  results,  and  (3)  logical  decision 
statements  which  direct  the  process  of  solution. 

Translation,  by  the  computer,  of  a  program  written  in  this  problem- 
oriented  language  to  a  machine- language  program  is  frequently  referred 
to  as  "compilation,"  hence  the  term  --  algebraic  compiler  (or  translator). 

Among  the  many  advantages  of  employing  an  automatic  programming 
system  for  most  problem  areas  are  the  following: 

1.  Short  programmer  training  periods. 

2.  Ability  to  phrase  problems  in  terms  of  familiar  problem-oriented 
notation. 

3.  Shorter  program  preparation  times. 

4.  Reduction  of  programming  errors. 

5.  Error  indications  as  part  of  the  compilation  process. 
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This  report  is  divided  into  three  sections: 

Part  I  provides  a  very  brief  description  of  the  computer  facilities 
of  the  Center. 

Part  II  describes  the  various  available  automatic  programming  systems 
and  their  general  areas  of  application. 

Part  III  describes  the  major  general-purpose  computer  programs  in 
the  Center  program  library.     These  programs  are  available  for  immediate 
use . 


PART  I 
THE  COMPUTER  FACILITIES 

In  August,  1959,  the  Consolidated  University  of  North  Carolina  in- 
stalled a  new  UNIVAC  Scientific  1105  Computer  on  campus.     This  development 
was  made  possible  by  the  cooperation  and  generosity  of  the  State  and  Fed- 
eral governments.  Remington  Rand  UNIVAC,  and  the  National  Science  Foundation. 
On  July  1,  1962,  the  Computation  Center  was  placed  under  the  administra- 
tion of  the  Chancellor  of  the  University  of  North  Carolina  at  Chapel  Hill. 
The  Center  operates  under  a  Director  who  reports  to  the  Dean  of  the  Grad- 
uate School. 

The  UNIVAC  1105  is  a  "large-scale"  automatic  digital  computer.  It 
is  capable  of  performing  approximately  30,000  additions  and  subtractions 
per  second;  approximately  4,000  multiplications  per  second;  and  approxi- 
mately 2,000  divisions  per  second.     It  has  8,192  words  of  magnetic  core 
storage  with  an  access  time  of  8  microseconds,  and  over  32,000  words  of 
directly  accessible  magnetic  drum  storage  with  an  average  access  time  of 
17  milliseconds. 

The  input/output  equipment  on  the  1105  consists  of  seventeen  UNISERVO 
magnetic  tape  read-write  units,  a  2000  card-per-minute  punched-card  reader, 
and  high-speed  punched  paper  tape  read  and  punch  units.     Hard  copy  list- 
ings of  computer  output  are  printed  on  an  off-line  high-speed  printer  which 
has  a  printing  speed  of  600  lines  per  minute.     Complete  buffering  allows 
independent  operation  of  the  main  computer  and  the  magnetic  tape  units 
which  makes  the  computer  very  efficient  on  input/output  operations. 

The  Center  also  maintains  an  input  preparation  room  where  programs 
and  data  are  prepared  for  the  computer  by  transferring  the  information  to 
either  punched  cards  or  punched  paper  tape. 


PART  II 


PROGRAMMING  SYSTEMS 


NC  LA  02  -  The  USE  III  Compiler 

USE  III  is  the  standard  assembly  program  for  the  UNIVAC  1103A  and 
1105  computers.     It  was  developed  under  the  sponsorship  of  the  UNIVAC 
Scientific  Exchange. 

An  assembly  language  program  is  only  one  step  removed  from  a  machine 
language  program  and  is  normally  employed  by  only  the  more  experienced 
programmer.     Even  then  its  use  is  restricted  to  production- type  programs 
in  which  it  is  imperative  that  the  most  efficient  program  be  written. 

Programming  information  for  this  system  is  to  be  found  in  the  USE 
Compiler  General  Reference  Manual  UP-3844,  Sperry  Rand  Corporation,  1963. 


NC  LB  01  -  Internal  Translator  (IT) 

The  IT  Compiler  is  a  UNIVAC  1105  version  of  the  first  automatic  cod- 
ing system  ever  written  for  a  computer.     It  is  an  algebraic  language  pri- 
marily intended  for  scientific  and  engineering  type  problems.  This 
particular  system  contains  a  large  library  of  subroutines  designed  for 
statistical  type  problems  and  is  most  convenient  for  the  programming  of 
problems  in  this  category  which  cannot  be  handled  by  one  of  the  general 
statistical  library  programs. 

Though  not  designed  for  the  programming  of  non-numeric  problems, 
this  system  can  be  used  to  advantage  in  this  area. 

Programming  information  for  this  system  can  be  found  in  the  IT  Manual, 
Computation  Center,  University  of  North  Carolina, 


NC  LB  02  -  Generalized  Algebraic  Translator  (GAT) 

GAT  is  an  easily  learned  algebraic  programming  language  and  a  direct 
successor  of  the  IT  Language.     It  offers  conveniences  and  efficiencies  not 
available  in  the  IT  Language  and  is  reconmiended  over  that  language  for  all 
problem  areas  except  statistics.     The  GAT  subroutine  library  is  still  under 
development  and  does  not  have  the  sophisticated  statistical  subroutines 
ctirrently  available  in  the  IT  System. 

Though  basically  a  numeric  programming  language,  this  system  has  wide 
application  to  non-numeric  problems  due  to  the  availability  of  special 
alphanumeric  subroutines. 

Programming  information  for  this  system  is  available  in  the  General- 
ized Algebraic  Translator  Manual,  Computation  Center,  University  of  North 
Carolina,  1962. 


NC  LB  07  -  Information  Processing  Language  V  (IPL-V) 

IPL-V  is  an  interpretive  system  developed  hy  the  Raad  Corporation. 
Its  basic  application  is  to  heuristic- type  problems  which  involve  the  man- 
ipulation of  long  and  complex  lists  of  data.     Examples  of  such  problems 
are  the  playing  of  chess,  mechanical  theorem  proving,  learning  models, 
simulation,  etc.     The  language  is  completely  symbolic  and  cannot  be  class- 
ified in  a  general  category  such  as  algebraic,  English,  etc.     IPL-V  is 
built  around  the  concept  of  a  "linked  list."    The  operations  of  IPL  are 
all  designed  to  operate  on  such  lists  and  provide  convenient  methods  for 
making  insertions  in  a  list,  deletions  to  a  list,  combining  lists,  making 
lists  of  lists,  etc. 

Programming  information  for  IPL-V  can  be  found  in  Programming  Note 
No.  87  and  in  the  Information  Processing  Language  V  Manual,  Sections  I 
and  II,  Rand  Corporation,  1960. 


NC  LB  08  -  Ctenon  Business  Oriented  Language  (COBOL) 

In  April,  1960,  the  cooperative  efforts  of  a  group  composed  of  compu- 
ter users  in  industry,  the  Department  of  Defense  and  other  Federal  Govern- 
ment agencies  and  computer  manufacturers,  culminated  in  the  publication 
of  the  initial  specifications  for  a  computer  programming  language.  This 
language  was  designed  specifically  for  use  with  a  class  of  problems  often 
called  "business  data  processing"  as  opposed  to  the  group  of  problems  con- 
sidered "scientific  data  processing"  and  was  called  COBOL,  for  COmmon  Busi- 
ness Oriented  Language. 

A  version  of  COBOL  is  available  on  the  UNIVAC  1105.     COBOL  is  basi- 
cally an  English  language  compiler  with  the  statements  specifically 
designed  for  stating  accounting  problems,  payroll  problems,  inventory 
problems,  etc. 

Information  on  the  particular  version  of  COBOL  available  on  the 
UNIVAC  1105  can  be  found  in  Programming  Note  No.  112  and  in  the  COBOL 
Manual,  U.  S,  Department  of  Commerce,  Bureau  of  the  Census,  September, 
1962. 

A  more  general  reference  on  COBOL  is:  Department  of  Defense,  Report 
to  Conference  on  Data  Processing  Languages,  Supt.  of  Documents,  Wash.  25, 
D.  C. 
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PART  III 


GENERAL  LIBRARY  PROGRAMS 


NC  DE  01  -  Analytic  Differentiation  Program 

Ti;e  analytic  differentiation  program  is  designed  to  generate  the 
symbolic  derivatives  of  any  algebraic  or  transcendental  functions.  That 
is,   if  an  expression  such  as, 

2 

ax    +  bx  +  sin  cx  , 

is  input  as  program  data  the  program  will  generate  any  order  derivative 
requested.     If  the  first  order  derivative  xoere  requested,  the  output 
would  then  be 

2ax  +  b  +  c  cos  cx 

The  program  is  capable  of  generating  as  high  an  order  of  derivative 
as  desired,  and  partial  derivatives  of  designated  dependent  variables 
with  respect  to  a  maximum  of  frve  independent  variables.     The  variable 
of  differentiation,  the  input  expresrion,   the  derivative  expression,  and 
the  order  are  output  for  each  designated  variable  and  for  each  order  of 
differentiation  specified. 

Additional  detailed  information  can  be  found  in  Programming  Note  No. 
91  and  the  following  general  reference. 

Hanson,  J.  W. ,  J.  S.  Caviness  and  C.  Joseph,   "Analytic  Differentia- 
tion by  Computer."    Comm.  ACM  5  (June  1962). 


A     : -rd  +  '--i 


NC  EB  04  -  Time  Series  Decomposition  and  Adjustment  Program 

The  Time  Series  Decomposition  and  Adjustment  program  is  designed  to 
adjust  seasonal  and  irregular  time  series  to  a  form  that  shox^7S  primarily 
the  trend-cyclical  movements.     Seasonal  factors,   irregular  fluctuations 
and  many  summarj'  measures  useful  in  time  series  analysis  are  computed  in 
the  process.     This  program  contains  the  following  options; 

1.  Delivery-day  adjustment  may  be  made. 

2.  A  24-month  forecast  may  be  obtained. 

3.  Only  specified  tables  will  be  output. 

This  program  is  the  result  of  a  translation  by  the  FORTRAN- to- GAT 
translator  of    ER  TSDA  (ESSO  Research,  Time  Series  Decomposition  and  Ad- 
justment Program)  written  for  an  IBM  704.     ESSO  Research  added  to  earlier 
TSDA  programs,  provisions  to  permit  (1)  adjusting  the  original  data  for 
the  effect  of  the  number  of  delivery  days  based  on  values  supplied  by  the 
user  and  (2)  the  fitting  of  a  least-squares  trend  line  to  the  adjusted 
data  as  an  aid  in  forecasting. 

Additional  information  may  be  found  in  Programming  Note  No.  80.  and 
the  following  general  reference: 

Shiskin,  Julius,  Electronic  Computers  and  Business  Indicators,  Occa- 
sional Paper  57,  National  Bureau  of  Economic  Research,  Inc.,  1957. 


NC  EB  05  -  Time  Series  Decomposition  and  Adjustment 

This  is  the  time  series  decomposition  and  adjustment  program  current- 
ly employed  by  the  Bureau  of  the  Census.  (Nov.,  1963)  It  incorporates  the 
X-9  and  X-10  versions.  It  is  similar  to  NC  EB  04  except  that  this  earlier 
series  is  the  X-3  version. 

NC  EB  05  is  designed  to  adjust  seasonal  and  irregular  time  series  to 
a  form  that  shows,  primarily,  the  trend-cyclical  movements.     Seasonal  fac- 
tors, irregular  fluctuations  and  summary  measures  useful  in  time  series 
analysis  are  also  computed  in  the  process.     In  addition  to  tabular  output, 
charts  which  visually  indicate  the  irregular  fluctuations  present  in  time 
series  are  available  as  optional  output. 

Additional  operational  information  for  this  routine  can  be  found  in 
Programming  Note  No.  113.  In  addition  to  the  general  reference  given  in 
the  description  of  NC  EB  04  the  following  references  might  be  of  interest. 

Shiskln,  J.,  "Tests  and  Revisions  of  Bureau  of  the  Census  Methods  of 
Seasonal  Adjustments,"  Technical  Paper  No.  5,  U.  S.  Dept.  of  Commerce, 
Bureau  of  the  Census,  Wash.,  D.  C,  1961. 

"Specifications  for  the  X-10  Version  of  the  Census  Method  II  Seasonal 
Adjustment  Program,"  Office  of  the  Chief  Economic  Statistician,  Bureau  of 
the  Census,  March  6,  1962. 


NC  FB  06  -  Eigenvalues  and  Eigenvectors  of  a  Symmetric  Matrix 

This  program  will  determine  the  eigenvalues  and  eigenvectors  of  a  real 

symmetric  matrix.     In  doing  so,  the  real  symmetric  matrix  (A)  is  diagonal- 

ized  by  performing  a  sequence  of  orthogonal  transformations,  each  designed 

to  reduce  one  off-diagonal  element  to  zero.     In  a  matrix  of  order  n 
2 

there  are     (n    -  n)/2    off-diagonal  elements  (only  the  lower  triangular 

2 

part  of    A    is  read  in).     More  than    (n    -  n)/2     transformations  will  be 
required  since  an  element,  once  reduced  to  zero,  will  not  necessarily 
remain  zero.     However,  each  transformation  will  reduce  the  sum  of  squares 
of  off-diagonal  elements  (SSOD)  by    ^a^^^,     where    a^^^    is  the  absolute 
value  of  the  element  reduced  to  zero.     In  a  finite  number  of  transforma- 
tions,  SSOD  can  be  reduced  to  an  arbitrarily  small  value. 

The  input  data  and  control  parameters  are  on  paper  tape.     The  output 
consist  of  the  following: 

1.  The  sum  of  squares  of  off -diagonal  elements  and  an  error  term. 

2.  The  eigenvector  matrix  and  the  eigenvalues  . 

3.  (Optional)     The  eigenvalues  and  the  eigenvectors,  normed  to  their 
corresponding  eigenvalues  and  arranged  in  order  of  decreasing  magni- 
tudes of  the  eigenvalues,  output  as  a  binary  format  magnetic  tape 
which  can  be  used  as  input  to  NC  GF  01  -  Varimax  Rotation. 

This  program  will  handle  a  matrix  of  maximum  order  65. 
Additional  information  may  be  found  in  Programming  Note  No.  60. 


NC  FD  08  -  Matrix  Inversion  and  Linear  Systems  Program 

The  matrix  inversion  and  linear  systems  routine  is  a  preliminary  ver- 
sion of  a  program  that  computes  the  inverse  of  a  matrix    A    and  uses  this 
inverse  to  solve  the  linear  system    AX  =  B  ,    where    A    is  an    n  x  n 
matrix  and    B    and    X    are    n  x  m  matrices. 

The  Jordan  elimination  method  is  used  in  this  program  with  the  larg- 
est   remaining  diagonal  element  used  as  a  pivot. 

The  input  data  and  parameters  to  this  program  are  on  paper  tape. 
The  output  is  controlled  by  the  input  parameters.     The  following  output 
may  be  obtained: 

1.  The  A-matrix. . 

2.  The  A- inverse  matrix. 

3.  The  inverse  check. 

4.  The  B-matrix. 

5.  The  solution,  X  . 

6.  A    times  the  solution,  X     (AX)  . 

This  routine  is  restricted  to  an    n  x  n    A-matrix  where    n  ^  45  and 
an    n  X  m    B-matrix  where    n,  m  ^45. 

Additional  information  may  be  found  in  Programming  Note  No.  78. 
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NC  GA  09  -  Contingency  Table  Analysis 

This  program  is  designed  to  furnish  an  analysis  of  questionnaire  data 
on  punched  cards.     It  produces  cross- tabulations  of  one  card  column  with 
another  and  provides  summary  computations  such  as  Chi-Square,  means,  me- 
dians, Kendall's  Tau^,  and  percentage  distributions.     Up  to  30  basic  tables 
may  be  produced  at  one  time.     These  30  tables  may  be  computed  for  any  num- 
ber of  designated  sub-samples  (optional  with  each  run).     For  example,  sep- 
arate tables  can  be  prepared  for  white  males,  white  females,  non-white 
males,  and  non-white  females,  or  any  mutually  exhaustive  set  of  categories. 

The  data  input  to  this  program  originates  on  punched  cards  which  are 
then  converted  to  magnetic  tape.     In  certain  cases,  a  parameter  paper  tape 
must  also  be  prepared. 

The  tabular  output  contains  the  following  information: 

1.  The  cross-tabulation  matrix  (frequencies). 

2.  Means  and  medians  for  each  row  computed  on  responses  0-9. 

3.  Chi-Square  with  its  significance  level  computed  for  each  table,  ignor- 
ing blank  responses, 

4.  Kendall's  Tau^  with    "Z"    score  and  test  of  significance  computed  for 
each  table,  ignoring  blank  responses. 

5.  A  percentage  distribution  for  each  matrix,  ignoring  blank  responses. 
Normally  this  distribution  is  computed  for  each  row  from  the  row  totals, 
but  the  option  is  available  to  compute  the  distribution  from  the  grand 
total. 

Additional  information  may  be  found  in  Programming  Note  No.  74,  (Second 
Revision), 
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NC  GB  09  -  Analysis  of  General  Linear  Model 

This  program  will  perform  any  of  the  usual  computations  desired  in 
analyzing  a  specific  multiple  regression,  correlation,  or  analysis  of 
variance  problem  concerning  the  linear  model, 

Y      -  X     B       +  €       -  [y,y,« .  .y]  , 
nxp       nxq  qxp       nxp       •'V2  ^ 

where    Y    is  a  matrix  of  observed  dependent  variables, 

X    is  a  matrix  of  observed  independent  variables  of  rank    q  , 

B    is  a  matrix  of  linear  regression  parameters,  each  column  vector 

corresponding  to  a  different  dependent  variable,  and 
€    is  a  matrix  of  column  vectors  of  random  normal  errors  with  mean 

0  and  variance- covariance  matrix 

nxn 

Data  may  be  specified  as  belonging  to  any  of  ten  groups  and  to  any  lev3l 
within  that  group.  The  analysis  is  performed  for  each  level  within  each 
group.     The  ordering  of  data  is  immaterial. 

The  input  data  to  this  program  originates  on  punched  cards.  The 
output  includes  the  following  information: 

1.  For  the  parameter  values  set  for  all  optional  output, 

a.     The  estimates  of  the  regression  coefficients  with  their  standard 
deviations  and  appropriate  values  of  student's     t    statistic  for 

testing  individual  regression  coefficients  equal  to  zero. 

2 

h.    The  multiple  correlation  coefficient    R  (not    R    )  . 
c.    The  estimate  of  the  error  standard  deviation,  i.e.,  the  standard 
error . 

2.  For  each  null  hypothesis  to  be  tested. 

a.    The  sums  of  the  cross-products  matrix. 
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NC  GB  09  (cont.) 


b.    The  analysis  of  variance  table  for  the  hypothesis. 
3.    In  the  case  of  missing  values. 

a.  The  number  of  times  variables    i    and    j    appear  together  in  a 
data  set. 

b.  The  means  and  standard  deviations  of  variables    i    and    j  based 
on  those  data  sets  in  which  the  variables  appeared  together. 

c.  The  linear  correlation  coefficient  for  variables    i    and    j  . 
The  number  of  dependent  variables  used  in  this  program  must  be  less 

than  20.    The  number  of  independent  variables  must  be  less  than  50  for  the 
regression  analysis  or  analysis  of  variance.    If  only  correlations  are 
requested,  the  ntmiber  of  independent  variables  may  be  less  than  63. 
Additional  information  may  be  found  in  Programming  Note  No.  86. 


NC  GB  15  -  Multipla  Regression  Analysis  for  Models  with 
Categoric  Predictors. 

This  IT  program  is  designed  to  provide  aspeciallj?  fast  and  versatile 
analysis  of  the  regression  model  when  categoric  or  binary  coded  predictors 
are  used.     It  is  especially  suited  for  survey- type  data  as  used  in  the 
social  sciences.     Up  to  11  independent  variables  and  as  many  as  46  predic- 
tor variables  may  be  included  in  a  single  run.     Each  set  of  regression 
equations  and  the  restricted  regression  equations  for  sub-hjrpostheses  are 
computed  for  each  dependent  variable. 

Regression  equations  are  of  the  form, 

Sub-hypotheses  are  of  the  form, 

@.  =  8.  =  ...  =  e,  =  0 

1      J  k 

where    i,  j,  k    are  appropriate  subscripts. 

Additional  detailed  instructions  on  this  program  may  be  found  in 
Programming  Note  No.  109. 
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NC  GF  01  -  Varimax  Rotation 

Input  to  the  varimax  rotation  program  consists  of  a  sequence  of  n 
eigenvalues  followed  by  an    n  x  m    factor  matrix,  where  the  factors  have 
been  normed  to  the  corresponding  eigenvalues .    The  program  maximizes  the 
sum  of  the  fourth  power  factor  loadings  for  the  matrix  by  performing  or- 
thogonal rotations  on  the    r    largest  factors,  where    2  i  r  <  n  . 

Input  is  normally  magnetic  tape  data  prepared  by  NC  FB  06  (Eigen- 
values and  Eigenvectors  of  a  Symmetric  Matrix)  but  it  may  also  be  pre- 
pared on  punched  paper  tape. 

The  output  from  the  program  contains  the  following: 

1.  The  original  n  x  r^  factor  matrix,  A,  with  the  corresponding  eigen- 
values (if  input),  (r^  ■  the  upper  limit  of  the  number  of  factors  to 
be  rotated) 

2.  The  communalities  over  the  first    r  factors. 

3.  The  varimax  solution  matrix,  A  ,  over  the  first    r    factors  with  the 

r 

sum  of  squares  of  the  factor  loadings. 

4.  (Optional)    The    n  x  r    transformation  matrix,  T.     (AT  ■  A^.)  . 

5.  Items  2,  3,  4  are  repeated  in  sequence  for  each  successive  value  of 

r  ,  from  the  lower  and  upper  limits  of  the  number  of  factors  to  be 

rotated,  r,  and    r  respectively. 
1  u 

This  program  will  take  an  input  matrix  of  a  maximum  order  of  100. 

The    r,    and    r      values  must  range  as  such:    2  i  r,  <  r    ^  n  . 
1  u  °  1  u 

Additional  detailed  information  may  be  found  in  Programming  Note  No. 
58,  and  the  following  general  references: 

Kaiser,  Educational  and  Psychological  Measurement,  vol.  XIX,  no. 3, 
1959,  pp.  413-420. 

Kaiser,  Psychometrika ,  vol.  XXIII,  1958,  pp.  189-200. 


NC  HA  01  -  IT  Language  Linear  Programming  Routine 

The  linear  programming  problem  involves  the  solution  of  a  system  of 
linear  inequalities  in  which  the  number  of  problem  variables  exceeds  the 
number  of  inequalities  and,  hence,  leads  to  an  infinite  number  of  possible 
solutions.     The  optimization  of  a  linear  function  of  the  variables  is  then 
used  as  a  criterion  for  the  determination  of  the  optimal  solution  to  the 
problem. 

The  general  problem  may  be  expressed  as  follows: 

Given  coefficients  of  the  linear  system  of  inequalities    a^.^   ,  cost 

coefficients    c,   ,  and  restrictions    b^   ,  determine  the  variables  x, 
J  j  J 

such  that  the  objective  function 

m 

T  ex. 

j-1    J  J 

is  minimized  subject  to  the  linear  constraints 

m  , 

V,    a..x.  ^  b.  "  =  l,...,n 

j=l    'J  J        J  m  ^  n 

Xj  ^  0  j  =  1, . . . ,m. 

This  particular  linear  programming  routine  uses  the  Simplex  Method 
computational  scheme  to  determine  a  solution  for  the  problem,  if  it  exists. 
It  will  handle  a  problem  with  a  maximum  of  39  restraints  and  124  variables 
(including  slack  variables) . 

Input  to  this  program  must  be  on  paper  tape.    The  output  consists  of 
the  following  information: 

1.  The  objective  function  and  constraints. 

2.  The  final  simplex  tableau. 
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NC  HA  01  (cont.) 


3.     Tb.e  feasible  solution  (if  it  exists).     If  there  is  no  solution,  this 
is  indicated. 

Additional  information  on  this  routine  may  be  found  in  Programming 
Note  No.  67. 

A  linear  programming  routine  capable  of  handling  much  larger  problems  is 
also  available  at  the  Center.    However,  this  routine  was  obtained  from  an 
outside  source  and  is  not  completely  operational.     The  Center  staff  should 
be  consulted  before  any  attempts  are  made  to  use  this  second  routine. 
A  general  reference  on  linear  programming  is: 
Gass,  S,  I.,  Linear  Programming,  McGraw  Hill,  New  York,  1958. 


NC  HC  01  -  Critical  Path  Scheduling  Routine 

The  critical  path  scheduling  routine  is  a  network  analysis  program 
similar  to  the  well-known  PERT-PEP  routines. 

Given  the  precedence  matrix  of  jobs  in  a  project,  a  distribution  of 
their  minimum  and  maximum  completion  times,  and  costs,  the  critical  path 
scheduling  routine  produces  a  set  of  tables  for  a  range  of  possible  pro- 
ject durations  from  maximum  time  at  least  cost  to  minimum  time  at  great- 
est cost.    Each  table  contains  the  costs,  the  earliest  and  latest  start 
and  completion  times,  and  the  total  and  free-float  times  for  each  job  in 
the  project.    Jobs  are  called  critical  when  the  total  float  is  zero,  i.e., 
when  the  job  duration  is  equal  to  the  total  time  available  for  the  job. 
If  a  project  contains  critical  jobs,  then  it  contains  at  least  one  con- 
nected path  of  critical  jobs  through  the  precedence  matrix  from  start  to 
finish.     This  is  called  the  critical  path.     This  routine  is  limited  to  a 
maximum  of  3000  jobs  and  to  a  maximiom  of  1000  nodal  points  in  the  network. 

An  example  of  the  output  from  a  critical  path  run  follows.     The  first 
page  is  an  arrow-diagram  of  the  problem.     The  second  page  is  one  of  the 
tables  produced  and  the  summary. 

Additional  detailed  information  may  be  found  in  Programming  Note  No. 
Ill  and  in  the  following  general  reference; 

"Critical  Path  Planning  and  Scheduling,"    Mauchly  Assoc.,  Inc.,  in 
Proceeding  of  the  Eastern  Joint  Computer  Conference,  1959. 


APPENDIX  C  -  Arrow  Diagram 
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NC  IL  07  -  Record  Rearrangement  Program 

This  program  is  designed  to  produce  a  single-card  record  from  a  mul- 
tiple-card record  through  any  selected  combination  of  column  transfers, 
column  duplications,  recoding  of  punches  within  a  column,   forming  combina- 
tion codes  from  the  punches  in  several  columns  and/or  the  insertion  of 
alphanumeric  characters  into  given  columns.     The  specific  purpose  of  this 
routine  is  to  provide  an  easy  method  for  the  generation  of  data  tapes  for 
NCGA09,   the  standard  questionnaire  analysis  program  and  NCGB16 ,   the  re- 
gression analysis  program  for  survey  data;  however,  it  has  application  in 
general  data  tape  modification  and  preparation. 

Additional  detailed  information  for  this  routine  may  be  found  in 
Programming  Note  No.  89. 


NC  MA  04  -  Sort-Merge  Generator 

The  sort-merge  generator  is  a  program  developed  by  the  Bureau  of  the  Census 
and  provides  the  UNIVAC  1105  with  the  capability  of  handling  a  wide  variety  of 
sort-merge  operations  with  a  minimum  of  programming.    If  this  generator  program 
is  provided  with  a  limited  number  of  parameters  describing  a  particular  sort- 
merge  problem,  it  will  generate  a  computer  program  capable  of  performing  the 
required  task. 

Sort  programs  can  be  generated  to  handle  alphabetic  and/or  numeric  infor- 
mation occurring  in  unit  groups  of  up  to  120  characters.    With  a  unit  group,  a 
maximum  of  36  character  positions  may  be  selected  as  keys  for  reordering  (sort- 
ing) the  units  of  information. 

Merge  programs  may  also  be  generated  which  will  allow  the  user  to  merge 
similarly  ordered  lists  of  information  into  a  composite  list  of  information  of 
like  order.    Character  limits  for  unit  groups  and  keys  are  the  same  as  above. 

Additional  information  on  the  sort-merge  generator  can  be  found  in  the  Bu- 
reau of  the  Census  Computer  Systems  Manual  located  in  the  Computation  Center 
library. 
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NC  ME  02  -  Readability  Analysis,  Cards 

This  readability  analysis  program  is  designed  to  compute  the  Danielson 
Bryan  and  the  Farr- Jenkins-Paterson  (F-J-P)  readability  scores  of  English 
prose.    The  prose  selections  to  be  scored  are  input  to  this  program  on 
punched  cards.     Output  from  the  program  consists  of  the  following  infor- 
mation: 

1.  The  selection  number. 

2.  The  totals  for  sentences,  words,  monosyllables,  characters,  and  spaces. 

3.  The  variables  of  the  Farr- Jenkins-Paterson  formula:  sentence  length 
and  per  cent  of  monosyllable  words. 

4.  The  variables  of  the  Danielson-Bryan  formula:     characters  per  space 
and  characters  per  sentence. 

5.  Readability  scores  according  to  the  scaled  F-J-P  formula  (as  revised 
by  Richard  Dale  Powers),  and  the  scaled  and  unsealed  Danielson-Bryan 
formulas . 

6.  Verbal  analogues  are  given  to  the  scaled  F-J-P  formula  and  the  scaled 
Danielson-Bryan  formula.     The  verbal  descriptions  are  made  according 
to  this  table: 

90  -  100  -  very  easy  (3rd  grade  level) 
80  -    89  ■  easy  (4th  grade  level) 
70  -    79  -  fairly  easy  (5th  grade  level) 
60  -    69  -  standard  (6th  grade  level) 

50  -    59  -  fairly' difficult  (junior  or  high  school  level) 
30  -    49  -  difficult  (high  school  level) 
0  -    29  -  very  difficult  (college  level) 
Additional  information  may  be  found  in  Prograiraning  Note  No.  103  and 
in  the  following  reference. 

Danielson,  Wayne  A.  and  Bryan,  Sam  Dunn,  "Computer  Automation  of  Two 
Readability  Formulas,"  Journalism  Quarterly,  40:2,   (Spring,  1963). 
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NC  ME  03  -  Readability  Analysis,  AR  Tapes 

This  Readability  Analysis  program  is  designed  to  compute  the  Danielson- 
Bryan  readability  scores  of  Associated  Press  news  stories  stored  on  magnetic 
tape.    The  input  to  this  program  is  a  magnetic  tape  of  Associated  Press 
news  stories  and  control  data  punched  on  cards.    The  output  from  the  pro- 
gram consists  of  the  following  information: 

1.  The  selection  number. 

2.  The  category  according  to  the  control  card. 

3.  The  first  ten  characters. 

4.  The  totals  for  numbers  of  characters,  spaces,  and  sentences. 

5-    The  variables  of  the  Danielson-Bryan  formula:     characters  per  space 
and  character  per  sentence. 

6.  The  Danielson-Bryan  readability  scores.    The  unsealed  score  gives  the 
predicted  school  grade  level  at  which  the  selection  can  be  comprehended. 
The  scaled  score  corresponds  to  the  unsealed  score  but  varies  over  a 
wider  range  usually  between  0-100, 

7.  A  verbal  analogue  to  the  scaled  scores.    For  example,  "This  AP  story 
is  easy."    The  verbal  descriptions  are  made  according  to  this  table: 

90  -  100  ■  very  easy  (3rd  grade  level) 
80  -    89  -  easy  (4th  grade  level) 
70  -    79  ■  fairly  easy  (5th  grade  level) 
60  -    69  ■  standard  (6th  grade  level) 

50  -    59  -  fairly  difficult  (junior  or    high  school  level) 
30  -    49  ■  difficult  (high  school  level) 
0  -    29  -  very  difficult  (college  level) 
This  program  is  limited  to  one  data  tape  per  run. 

Additional  information  may  be  found  in  Programming  Note  No.  104  and 
in  the  following  reference. 

Danielson,  Wayne  A.  and  Bryan,  Sam  Dunn,  "Computer  Automation  of  Two 
Readability  Formulas."  Journalism  Quarterly,  40:2,  (Spring  1963). 
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NC  MF  01  -  Key-Word- In- Context  (KWIC)  Indexing  Program 

The  Key-Word- in- Context  (KWIC)  indexing  program  produces  an  alphabetic 
index  of  key  words  from  a  prepared  list  of  docxament  titles.     Each  word  in 
a  title  that  is  not  on  a  prescribed  common-word  list  is  considered  a  key 
word;  each  index  entry  consists  of  the  key  word  shown  in  context  with  that 
portion  of  the  title  which  contains  it  and  a  number  Xi^hich  identifies  the 
document.     Such  an  entry  will  appear  for  every  key  word  in  every  title. 

The  KWIC  indexing  program  also  provides  a  high-speed  printer  listing 
of  the  bibliography  of  titles  to  be  indexed.  On  this  listing,  each  entry 
contains  the  complete  title,  source  information,  and  the  document  number. 
This  list  is  ordered  by  document  number. 

The  common-word  list  used  by  this  program  can  be  modified  as  desired 
or  a  different  list  substituted.  A  print-out  of  the  common-word  list  can 
also  be  obtained  as  an  optional  output. 

An  example  of  output  from  a  Kl'JIG  run  follows.     The  first  page  is  a 
sample  of  the  index;  the  second  page  is  a  sample  of  a  document  list. 

Additional  operational  information  may  be  found  in  Programming  Note 
No.  92. 
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NC  MF  02  -  Magnetic  Tape  Context  Scanner  Program 

The  magnetic  tape  context  scanner  program  searches  alphanumeric  data 
taps  for  partial  or  exact  duplication  of  words  specified  by  a  control  word 
list  (data  may  be  a  story,  poem,   listing,  etc.)'    Whenever  such  a  duplica- 
tion is  encountered  on  the  data  tape,   the  duplicated  control  word  plus  a 
stated  number  of  words  which  precede  it  and  follow  it  on  the  data  tape  are 
output. 

Input  to  this  program  is  a  magnetic  tape  containing  the  alphanumeric 
data  and  a  paper  tape  parameter  lists  which  specifies  the  number  of  words 
of  context  that  are  to  be  output  with  each  duplication  of  a  control  word, 
the  type  of  duplication,  partial  or  exact,  that  is  to  be  used,  and  the  list 
of  control  words. 

As  an  example,  one  might  be  interested  in  an  author's  views  on  a  given 
subject.     By  scanning  a  data  tape  of  his  works  with  this  program  while  us- 
ing a  control  word  list  of  key  words  closely  related  to  the  subject,  one 
would  obtain  a  listing  of  the  occurrences  of  all  such  words  on  the  data 
tape  in  the  context  in  which  they  occurred.     Hence  by  studying  this  list- 
ing, a  fairly  accurate  idea  of  the  views  of  the  author  concerning  the  given 
subject  could  be  formed. 

Additional  operational  information  on  this  routine  can  be  found  in 
Programming  Note  No.  98. 


NC  OA  02  -  IT  Plotting  Program 

This  program  provides  users  of  the  computer  with  the  facility  to  plot 
a  graph  of  their  computer  output  on  the  off-line  high  speed  printer.  Input 
to  the  program  is  an  HSP  output  tape  from  an  IT  Language  Program  or  any 
HSP  output  tape  written  in  the  four  column  IT  output  format.     The  program 
produces  a  graph  for  printing  on  the  high  speed  printer  with  one  indepen- 
dent variable  and  up  to  three  dependent  variables.     Seven  hundred  and 
fifty  lines  of  data  can  be  plotted  at  one  time. 

An  example  of  an  output  is  given  on  the  following  page.     This  figure 
gives    f(x)  =  sin  x,  f(x)  =  cos  x,  and    f(x)  =  l.llx  -  I,  plotted  over  a 
range    0      x  ^  n  . 

Programming  Note  97  contains  additional  detailed  information  on  this 
•routine . 
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NC  ZA  09  -  Multicomponent  Distillation  Calculation 

This  program  provides  a  general  solution  to  the  multicomponent  rec- 
tification problem.     The  system  of  nonlinear  simultaneous  equations  rep- 
resenting the  whole  column  are  solved,  component  by  component,  by  an 
iterative  procedure  which  requires  only  an  assumption  of  a  column  tem- 
perature distribution  for  light  hydrocarbon  rectifications  as  a  starting 
condition.     The  successive  iterations  then  correct  this  temperature 
distribution.     The  number  of  iterations  depends  on  the  precision  required, 
but  generally  four  or  five  iterations  are  adequate „ 

The  program  as  it  stands  will  accept  problems  with  up  to  10  compo- 
nents and  102  trays  (counting  reboiler  and  condenser).  The  method  used 
is  described  in  the  following  technical  article: 

Amundson,  N.  R.  and  A,  J,  Pontinen,  "Multicomponent  Distillation 
Calculations  on  a  Large  Digital  Computer,"  Industrial  and  Engineering 
Chemistry,  vol.  50,  page  730,  May,  19  58, 

Additional  operational  information  can  be  found  in  Programming  Note 
No.  114. 
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